Unravelling the Architecture of Duplications in Tumor Genomes
نویسنده
چکیده
Tumor cells frequently exhibit large-scale changes in their genome, including genome rearrangements (e.g. chromosome inversions and translocations), duplications, and deletions. Such genomic changes are directly implicated in the pathogenesis of cancer and can alter gene structure and regulation. While many individual rearrangements and duplications in tumors have been identified, little is known about the detailed architecture of tumor genomes. A recently introduced technique called End Sequence Profiling (ESP) allows the identification of both rearrangements and duplications in tumor genomes at high resolution [5, 3]. In contrast, array-based techniques such as comparative genomic hybridization (CGH) are very useful in the identification of duplicated material in a tumor genome, but give little information about the organization and locations of duplicated material within the genome. For example, it is difficult to infer from CGH data whether two duplicated regions are located close together (or even on the same chromosome) in the tumor genome. In some tumors, duplicated material from several disparate regions of the human genome is co-localized [1, 6]. However, the precise architecture of these complex duplications and the mechanisms that explain their formation are not completely understood. We develop a computational method that allow us to unravel the architecture of complex duplications in a tumor genome using ESP data. The technique relies on a model of breakagefusion-bridge (BFB) cycles, a biological process that produces both rearrangements and duplications in tumor genomes. We apply our method to ESP data from the MCF7 breast tumor cell line. This work complements earlier analyses of genome rearrangements [3] and duplication by amplisome [4] using ESP data.
منابع مشابه
O-44: Characterisation of Monotreme CaseinsReveals Lineage Specific Expansion of an AncestralCasein Locus in Mammals
Background: One important reproductive characteristic of Mammals is the production of milk to nurse the neonate. In order to better understand the evolution of milk we have investigated gene expression in milk cells from monotremes which are the most ancient representative of the mammalian lineage. Materials and Methods: Using a milk cell cDNA sequencing approach we characterise milk protein se...
متن کاملDecoding the Genomic Architecture of Mammalian and Plant Genomes: Synteny Blocks and Large-scale Duplications
Motivation: The existing synteny block reconstruction algorithms use anchors (e.g., orthologous genes) shared over all genomes to construct the synteny blocks for multiple genomes. This approach, while efficient for a few genomes, cannot be scaled to address the need to construct synteny blocks in many mammalian genomes that are currently being sequenced. The problem is that the number of ancho...
متن کاملDecoding Synteny Blocks and Large-Scale Duplications in Mammalian and Plant Genomes
The existing synteny block reconstruction algorithms use anchors (e.g., orthologous genes) shared over all genomes to construct the synteny blocks for multiple genomes. This approach, while efficient for a few genomes, cannot be scaled to address the need to construct synteny blocks in many mammalian genomes that are currently being sequenced. The problem is that the number of anchors shared am...
متن کاملOrigins and Impacts of Primate Segmental Duplications
Duplicated sequences are substrates for the emergence of new genes and are an important source of genetic instability associated with rare and common disease. Analyses of primate genomes have shown an increase in the proportion of interspersed segmental duplications within the genomes of humans and great apes. This is in contrast to other mammalian genomes that appear to have their recently dup...
متن کاملA Survey of Innovation through Duplication in the Reduced Genomes of Twelve Parasites
We characterize the prevalence, distribution, divergence, and putative functions of detectable two-copy paralogs and segmental duplications in the Apicomplexa, a phylum of parasitic protists. Apicomplexans are mostly obligate intracellular parasites responsible for human and animal diseases (e.g. malaria and toxoplasmosis). Gene loss is a major force in the phylum. Genomes are small and protein...
متن کامل